|
Close Help |
In College Town, Pizza Company gets a lot of business. They get so many
calls each night that they have to operate three kitchens to fill all the
orders. They decided to spread the kitchens out so that each one is near one
of the housing sections of the university. Since the same people own all
three branches of Pizza Company, they only hired one set of delivery drivers
to serve all three kitchens. After a driver makes a delivery, he or she goes
to the nearest kitchen to pick up the next order. Therefore, the location of
a delivery person's next order depends only on his or her present location.
The kitchens are logically named according to their area of campus. Of the
calls to kitchen A, 30% are delivered in area A, 30% go to area B, and 40%
go to area C. Of the orders placed at kitchen B, 40%
go to area A, 40% go to area B, and 20%
go to area C. Of the calls to kitchen C, 50%
go to area A, 30% go to area B, and 20 go to area C.
The picture below depicts the situation.
As you might guess, this information will be easier to read if we write it
in matrix form. We will call this matrix S because it expresses the
probability of movement (transition) from one state to
the next. A state is the condition or location of an object in the
system at a particular time. Therefore, our diagram is called a state
diagram. Matrices of this type are called transition matrices. Our labeled transition matrix looks like
this:
For each element, Sij, i represents the starting location, and j represents the ending location for that move. This means that the row is the beginning location, and the column is the ending location after one move. We will want to learn things about what will happen in the future to Pizza Company, and this situation has the attributes necessary for what is called a Markov chain. Therefore, we will model the problem as a Markov chain in order to obtain information about the future. A problem can be considered a (homogeneous) Markov chain if it has the following properties:
Remark 19 Requirement (c) is not a requirement of Markov chains in general. It is a requirement for a special kind of Markov chain called a homogeneous Markov chain. We will be studying only homogeneous Markov chains in this book, so we will use the term Markov chain to refer to a homogeneous Markov chain. The transition matrix used to model the Markov chain will have the following properties:
Remark 20 Some assumptions are not completely accurate, but if we did not make assumptions to generalize the problem, we would not have the abitlity to approximate a solution to the problem. We just need to make sure that our assumptions are reasonable. The assumption that each delivery takes the same amount of time is reasonable if you consider that the average delivery times should be nearly equal.Now that you know the background, we can begin the fun part. Do you know what the probability matrix would look like that describes where a car would be after exactly 2 deliveries? What about 3, 4, or 5 deliveries? Can you predict the probability matrix for the cars after a night of deliveries?
Well, lets start with a simpler question. If you begin at kitchen C, what is
the probability that you will be in area B after 2 deliveries? Think about
how you can get to B. We can go from C to C, then from C to B. We can go
from C to B, then from B to B. We can go from C to A, then from A to B. We
will let P(CB) represent the probability of going from C to B in one
delivery. Let's write this probability in our shorthand notation. Do you
remember how probabilities work? If two (or more) independent events must
both (all) happen, we multiply their probabilities together. If there are
two (or more) distinct events that would both (all) work, we add the
probabilities of those events together.
Remark 21 Notice that we said we can multiply probabilities together if the events are independent. We know that our events are independent of one another becasue someone in area A is equally likely to order a pizza whether or not someone in area B or C ordered pizza. If our events were not independent, we would not be able to simply multiply the probabilities.This gives us P(CA)P(AB) + P(CB)P(BB) + P(CC)P(CB) for the probability that a delivery person goes from C to B in 2 deliveries. Let us substitute numbers into our formula. We get (.5)(.3) + (.3)(.4) + (.2)(.3) = .33. This tells us that if we begin at kitchen C, we have a 33% chance of being in kitchen B after 2 deliveries. Let us try this for another pair. If we begin at kitchen B, what is the probability of being at kitchen B after 2 deliveries? Try this before you read further! The probability of going from kitchen B to kitchen B in two deliveries is P(BA)P(AB) + P(BB)P(BB) + P(BC)P(CB) = (.4)(.3)+(.4)(.4) + (.2)(.3) = .34. Now it wasn't so bad calculating where you would be after 2 deliveries, but what if you need to know where you will be after 5 deliveries? That would take a LONG time. There must be an easier way, right? Look carefully at where these numbers come from. Do they fall into rows or columns? As you might suspect, they are the result of matrix multiplication. Going from C to B in 2 deliveries is the same as taking the inner product of row 3 and column 2. Going from B to B in 2 deliveries is the same as taking the inner product of row 2 and column 2. If you multiply S by S, you will get the same answers that you got for these two questions and the rest of the probability matrix after 2 deliveries.
You will notice that the elements on each row still add to 1 and each element is between 0 and 1, inclusive. Since we are modeling our problem with a Markov chain, this is essential. This matrix indicates the probabilities of going from kitchen i to kitchen j in exactly 2 deliveries. Now that we have this matrix, it should be easier to find where we will be after 3 deliveries. We will let p(AB) represent the probability of going from A to B in 2 deliveries. Let's find the probability of going from C to B in 3 deliveries. The probability is p(CA)P(AB) + p(CB)P(BB) + p(CC)P(CB) = (.37)(.3) + (.33)(.4) + (.3)(.3) = .333. Where do these numbers come from? This probability is the inner product of row three of S2 and column two of S. Therefore, if we multiply S2 by S, we will get the probability matrix for 3 deliveries.
By now, you probably know how we find the matrix of probabilities for 4, 5 or more deliveries. Notice that the elements on each row still add to 1. Therefore, it is important that you do not round your answers. Keep as many decimal places as possible to retain accuracy.
Remark 22 If all the entries of the transition matrix are between 0 and 1 EXCLUSIVELY, then convergence is guaranteed to take place. Convergence may take place when 0 and 1 are in the transition matrix, but convergence is no longer guaranteed. For an example, look at the matrix Think about the situation that this marix represents in order to understand why Ak oscillates as k grows.Sometimes, you will be given a vector of initial distributions to describe how many or what part of the objects are in each state in the beginning. Using this vector, you can find out how many or what part of the objects are in each state at any later time. If the initial distribution vector is a vector of decimals, it tells what part of the total number of objects are in each state in the beginning. It contains only numbers between 0 and 1, inclusive, and the elements in the row sum to one. Alternatively, the vector of initial distributions could contain the actual number of objects or people in each state in the beginning. In this case, all the elements will be nonnegative and the elements in each row will add to the total number of objects or people in the entire system. For our example, the vector of initial distributions can tell you what part of the drivers originally begin in each area. If we start out with a uniform distribution, we will have of our delivery cars in each area. After one delivery, the distribution will be 40% of our deliveries in area A, in area B, and in area C. We find this by multiplying our initial distribution matrix by our transition matrix.
After the entire evening, we said that the fractions would converge to
particular numbers so that the area from which we start doesn't matter.
After many deliveries, we will obtain the same right-hand side no matter
with which initial distribution we start. For example,
Notice that the right-hand side is the same as one of the rows of our transition matrix after many deliveries. This is exactly what we expected because we said that of the people will be in area A after many deliveries regardless of what percentage of the people were in area A in the initial distribution. Check this with several initial distributions to convince yourself of the truth of this statement.
If the initial distribution indicates the actual number of people in the
system, our system can be represented by the following after one delivery:
Did you notice that we now have a fractional number of people in areas A and
C after one delivery? We know that this cannot happen, but this gives us a
good idea of approximately how many delivery people are in each area. After
many deliveries, the right-hand side of this equality will also be very
close to a particular vector. For example,
The particular vector that the product converges to is the total number of
people in the system (54 in this case) times any row of the matrix that Ak
converges to as k grows,
Try some examples to convince yourself that the vector indicating the number
of people in each area after many deliveries will not change if people are
moved from one state to another in the initial distribution. Also notice
that the number of people in the entire system never changes. People move
from place to place, but the system never loses or gains people.
Remark 23 It is usual to associate the word vector with a column vector, so a row vector is a transposed vector. Therfore, we will write
Remark 24 Some authors set up transition matrices so that j represents the starting location and i represents the ending location. In these cases the columns add to one. For this case, the entire equation is transposed, so instead of xTAk = bT where x is the column vector of initial distribution and b is the column vector of distribution after k steps, the equation is (Ak)Tx = b. |
Send comments on material to Tamara Carter
These pages are maintained by Hilena Vargas
(hvargas@rice.edu)
Updated: September 13, 2000
Copyright ©1995 - 2000 Tamara Lynn Anthony